Character-reading apparatus including improved character set sensing structure

ABSTRACT

A reading machine, including devices to pick up the signals of group identification points for effectively differentiating configurations, is designed for use independently or in conjunction with a coordinate point selection apparatus. A coordinate matrix of photocells, the number and size of which are dependent upon the type of configurations to be analyzed scan each configuration recording for each the presence and absence of writing in the area viewed by each of the cells in a coordinate system. The number of &#39;&#39;&#39;&#39;written&#39;&#39;&#39;&#39; areas are then added numerically on a coordinate basis for all of the letter configurations. A combinatorial constant n, where 2n equals the total number of letter configurations, is derived and dictates the use of a specific stored combination chart. All those coordinate points have the totals 1 to n-1 (where n is the number of configurations) are separately stored and compared intra se to select those within each group which are unique. A selective choice is then made of predetermined groups to obtain a combination of unique subgroups equal to n. This is the group identification pattern or set of points unique to the particular number and type of configurations sought to be recognized.

United States Patent [72] Inventor Hsing Chu Lee Primary Examiner-Maynard R. Wilbur 219 W. 106th St., New York, N.Y. 10025 Assistant Examiner-William W. Cochran [21] Appl. No. 790,811 AnorneyHopgood and Calimafde [22] Filed Jan. 13, 1969 [45] Patented Dec. 7, 1971 ABSTRACT: A reading machine, including devices to pick up the signals of group identification points for effectively dif- [54] CHARACTER-READING APPARATUS INCLUDING ferentiating configurations, is designed for use independently IMPROVED CHARACTER SET SENSING or in conjunction with a coordinate point selection apparatus. STRUCTURE A coordinate matrix of photocells, the number and size of 3Claims, 18 Drawing Figs.- which are dependent upon the type of configurations to be analyzed scan each configuration recording for each the [52] US. Cl ..340/l46.3 F, presence and absence of writing in the area viewed y each of 340/1463 T, 340/1463 Z, 340/1463 S,

the cells in a coordinate system. The number of written 250/219 CR, 340/1725 I 5 n Int Cl I 606k 9/12 areas are then added numerically on a coordinate basis for all [so] Fie'ld 340 46 3 of the letter configurations. A combinatorial constant n, where 2" equals the total number of letter configurations, is 56 R i C" d derived and dictates the use of a specific stored combination I I e erences e chart. All those coordinate points have the totals 1 to n-l UNITED STATES PATENTS (where n is the number of configurations) are separately 3,106,699 10/1963 Kamentsky 340/1463 stored and compared intra se to select those within each group 3,177,470 4/1965 Galopin 340/1463 which are unique. A selective choice is then made of predeter- 3,192,505 6/1965 Rosenblatt. 340/1463 mined groups to obtain a combination of unique subgroups 3,275,986 9/1966 Dunn et al.. 340/1463 equal to n. This is the group identification pattern or set of 3,295,103 12/1966 Driese et al. 340/1463 points unique to the particular number and type of configura- 3,412,255 l [/1968 Krieger 340/1463 tions sought to be recognized.

1 :1 "EL "3 }--u|i;,-\IPAR.-1TOR}- cameo/m7: 1 a

sronss I .70 Mi 4 4/, CONFIGURA E i M 1 m/vwl. MAIF/GURAT/GV a I T2 42 "'4 Tk/x COORDINATE LWMFMRATQR mllfm ADDER T :3 4 ali 4 ELECMR raw/4594110 I 5 M70? ;7 43

MAM/4L a/NARY cons/NATION maxim lD'Nf/F/CdT/OA/ 30 J7 coma/Mares PATENTEU DEC 7 am SHEET 2 OF 5 OOOOOO OOOOQOOOO 0000 000 0000 000 00800000000000 00000000 0000000 000000000000000 000000000000000 Y 000000 00 000 0 000 000 00000000000 000000 000000000 000000 00000 0 000 00000 0 00000000 456759Mflfl006 x FIGZb PMCEEING INVENTOR. l/SING CHU LEE ATTORNEXS' SIGMA L SELECT 0N PATENTEDUEC mm 1162635 SHEET 5 OF 5 F I INVENTOR.

AIS/N6 CHO LEE BY Arman-r5 The above-mentioned and other features and objects of this 3 626 ,3 6 8 1 2 CHARACTER-READING APPARATUS INCLUDING FIGS. 2a-2d, including 20', illustrate four machines IMPROVED CHARACTER SET SENSIN TR E designed to automatically read configurations;

FIGS. 3-6, including 4a, illustrate graphically the progres- BACKGROUND OF THE INVENTXON sion of steps according to the inventive method;

This invention relates t h t di hi 5 FIGS. 7a-7d illustrate some possible combinations of three generally, and, in particular, to a method for automatically Seis; deriving an identification format or set for the particular con- 8 illustrates one Possible combination for eigh Sets? figuration of letters, figures, etc., under consideration. and

The reading machine art has now become well developed to FIGS. 9 and 10 show identification sets for numerals and the point where a great number of systems exist for the letters of one type font, respectively. identification of written and printed characters (generally DETAILED DESCRIPTION OF THE INVENTION referred to hereinafter as "configurations" to embrace under one genus all possible written and printed species). The machines function in a wide variety of operating modes depending upon the particular font being recognized and the flexibility and versatility of the machine.

Some common types of apparatus include mechanisms which spot scan, line scan, or area scan. Functionally, there are mechanisms which compare read data against stored data, mechanisms which use masking techniques, mechanisms which employ Boolean logic, and so on.

Regardless of what type of arrangement is used, in order to build in greater flexibility and versatility, conventional devices The invention shall now be described in detail, first with respect to its method, and then with respect to a specific apparatus for accomplishing the foregoing method, and then with respect to machines for utilizing the identification points derived by the method.

The description which follows is directed at the procurement of a set or group of identification points, these points defining such mutual differences between the letter characters (the configurations chosen) such that the identity of each character may be unambiguously determined.

Consider, for example, three sets, A, B and C (including the possibility of zero, it is four). They may intersect several diftend to be extremely sophisticated in circuitry and expensive ferem ways, as FIGS 7 7 Show Dividing the whole area to build and maimain- To give an p Where a greai into a, b, c, ab, ac, hr, and abc and n: Set A (similarly is the y of fonts configurations are to be analyzed com/en case with the sets B and C) in FIG. 7a has a nonoccupancy of tional arrangements may include a whole mosaic of photocells areas b, be, and c. Thus, two or three points may be chosen in conjunction with sophisticated logic or masking circuitry from particular areas in order to identify the sets (no more which must be programmed in great detail in order to analyze than one point being chosen for the same area) and the group the configurations under consideration. identification is as follows:

TABLE I Accordingly, it is the object of this invention to provide an A point can be chosen from each area (total seven points) accurate and low-cost re ognition machine for written and without simplification to also serve the identification purpose printed characters. as follows:

It is a further object of this invention to provide a group identification method and apparatus for automatically deter- TABLE 11 mining the simplest and most efficient identification set or read-head format for a particular family of configurations. abc It is a further object of this invention to provide a method of X the foregoing type easily employed directly with reading X machines, i.e., which speaks a common machine language and X is therefore adaptable to direct adjunct use. r

It is a further object of this invention to provide a method and apparatus according to the foregoing object which is ex- For ldemfficauon purpPses' f f "relevant to namely versatile and which is adaptable to most configura choose a point from abc sincethis area IS the intersection of ions including letters figures em the three sets and has no identification value. If the three sets Briefly the invention is predicated upon a method and intersect as in FIG. 7b, the only choice IS aband be (two paratus for storing, in a coordinate system, each of the particular configurations in the family under consideration, and then comparing the configurations intra se, mathematically, to determine the minimum number or most efficient group identification points (group identification set) which will uniquely recognize each of the configurations in the family, and recognition machines to read the sets.

and 7d, at least three points are necessary. From FIG. 7d, it is apparent that the choice is one from each set.

Consider, for example, eight sets as in FIG. 8. The areas of intersection and nonintersection are a, ab, abc ...g. There is a mathematically necessary number of points or spots in order to identify the set. Simply expressed, this number may be represented by n where 2" z the number of sets (in the foregoing the number of sets is equivalent and used interchangeably invention and the manner of attaining them will become more with the number f configurations or Enemy apparent and the invention itself will best be understood by Assuming h i h sets b k d as Shown i FIG, 8,

reference to the following description of embodiments of the several identification points (which make up one identificainvention taken in conjunction with the accomp nying tion set) may be chosen in order to unambiguously identify the drawings. th description f whi h f Whefeifl character; sets. For example, table III shows some of the mm FIG. I is a block schematic diagram f n embodimen! 0f binations which may be chosen assuming four identification th e invention; points.

points). If, on the other hand, the sets intersect as in FIGS. 7c I TABLE III ab bcd cef fg ad bed cef f ah bed ccf ceg It is to be understood that the number of points of identifileast a three spot chart would be necessary. Included then in cation may be increased for the best choice or to agree with the available combinations would be the top five lines where the machine language. Accordingly, five points shown in table the sum is l, 2, 2; lines 3 to 6 where the sum is 2, 2, 2; or lines lV may be chosen to identify the character sets. Even more through 8, where the sum is 3, 3, 3. spots can be derived where desired. In order to clarify the invention still further, an operative example will now be described in which six Hebrew letters (FIG. 3) will be operated upon in order to automatically choose a group identification set for unambiguously identifying the character. For simplicity of reading, underneath each Hebrew character is a rough English equivalent for discussion purposes.

The following apparatus to be described is computer in type, and while the computer stages and the relationships between them are specifically shown and described in block format, and an analytical analysis of each of the stages is also set forth in detail, it will be appreciated by those skilled in the art that a description of the details of the computer com- Generally, the number of identification points derived from ponents would only encumber this description, and the selecthe analysis in order to identify all of the individual configurai f h d vices is purely mechanical. tions Varies depending upon: the number Ofcohfigufations In accordance with the invention, each of the Hebrew TAB LE IV to be recognized; (h) the Style of the configurations; and (C) characters is sequentially scanned by a light-sensitive the Size of the P It win he appteciated that one P p of photocell matrix 10 as shown in FIG. I. The photocell matrix the invention is to minimize the number of identification is made up f a coordinate array f h ll h b r points tothus economize the associated circuitry in the readand size f which are l d d d t on the complexity mg machlheofthe characters and the capacity of the cells involved.

The Shape and Size of the Points depend p the Pickup The output of the photocell matrix 10 is fed via a sequencer devicesthe P p of this disclosure it will be 12, which may operate manually or automatically with the adderstood that any type Pickup may be used including most of vancement of the respective Hebrew characters to coordinate the Variety of Scan devices available Oh the market The stores 14 through 19 (additional coordinate stores are, of Smaller the identification Spot may the Smaher may be the course, necessary for larger size character sets; however, they number of identification P h that are necessary the will not be needed for this example). Each coordinate store greateh the number of posstbte gFQUPS t uhambtguousty can include, for example, a matrix of ferrite cores equivalent determine the character, thus making available cross-check in number and position to the photoceus The Cells are groups for ldemlficauon' In any case the paliucular referenced to the cores on a one to one basis with writing-in" group chosen depend upon any number of factors mvolv' dependent upon the presence or absence of a written area at thg cost Pt of cells closeness machine languages that coordinate. Threshold devices may be employed to selec- Addmohat Spots may be added wlthout affecting the tively include or exclude partial strokes within cell areas.

operatmnITheSe addition?! Spots be utilized guidihg Coordinate stores 14 through 19 are coupled to coordinate t operation of the machme the spacing i devlce edn- 5O adder 20 which accumulates the totals shown in FIG. 4. Thus, Since these Spots are not for recognmon purposes for example, in the 8X8 matrix shown, 64 separate totals will they Wm h be dlscussed r be accumulated in the coordinate adder 20, the resultant ac- There is, however, a minimum number of points WhlCh decumulatlons each representing the sum of the characters at pend upon the number of characters to be analyzed. This the 64 coordinate points.

number isnwhere 2" the number ofcharacters.

A binary constant generator 30 generates a constant n TABLE V' COMBINATORIAL CHARTS where n is derived from the equation 2" e the number of characters. The number of characters in this case is 6 and n 2 spots sspots pots pots equals 3. Accordingly, the three-spot combination chart would be selected by store 37v Stores 29, 30...33...are provided, in each of which is stored information of the coordinate having the corresponding sum (i.e., l, 2, 3...p)', where p is the maximum necessary sum. As is shown in the figure, the stores 29, 30...33...are coupled to the configuration stores 14 through 19 in order to also provide memorization of the particular configuration which has written areas at the coordinate value. More specifically, and are made clear from figures 1, 3, 4 and 4a, the stores 29-37, e.g., the store 31 contains the identity of all coordinate sensing points (defined by an X- and a Y-coordinate) which sense a total of exactly three marked areas in the character set and, moreover, an ordering by each character of the character set The number n can then be employed to determine a specific indicating whether or not printed matter in that character concombinatorial chart delineating the possible permutations and tributes to the associated sum. Thus, for example, each of the combinations. Table V illustrates the charts for, respectively. coordinate sets Y,, X Y,,, X each sense printed matter in two, three and four points. For example. if four characters three characters of the assumed six-character set, wherein the were 10 be analyzed (including the blank it would be a! Hebrew letters identified by the English letters I), m and k contribute to the sum while the characters identified by the letters d, h and I do not so contribute. The pattern stored in the store 32 associated with the sum 4 is shown in the left portion of FIG. 4a wherein the same type of information is presented. FIG. 4a illustrates the contents of stores 31 and 32. The contents of other stores may be similarly arrived at from the data given in FIG. 4.

Coupled to each of the stores 29, 30...33...are comparators 39, 40...43..., respectively, which act to compare information within any store intra se. This comparison effects the information shown in FIG. 4a wherein a determination is made of which information is duplicate. Thus, for example, as may be seen from FIG. 4a all of the sum equals 3 set information is redundant and hence, only one may be used as a representative for further processing. Comparator 41 will present one coordinate printed matter detection pattern for the character set for further processing.

In this example, we have chosen sum 3 as the beginning upon the premise that sums l and 2 (29 and 30) have been either manually withdrawn or have been found ineffective by the apparatus.

Comparator selector 50 combines first the lowest store identification sum (3) to see whether in fact three mutually distinct points exist to unambiguously identify the characters. Since in this case it does not, the comparator selector now chooses the points available from the sum (4) store (again, only the nonduplicative points thereof) and these are staged with the sum (3) information in all possible permutations and combinations as shown in FIG. 5.

As will be appreciated, the introduction of a comparison between sum-3 and sum-4 inter se produces other combinations in which ambiguous readings may be effected. Thus, for example, in the first grouping in FIG. 5, d and I would be ambiguously determined. Comparator selector 50, therefore, chooses one grouping (for example, group I 1, sum 3, 4, 4) of identification points or one identification set for unambiguously identifying the characters.

FIG. 6 illustrates the example in which four identification spots are chosen to identify the letters. This choice could obtain, for example, when none of the combinations of FIG. 5 effect the desired result, or for other design purposes. In this case, a manual input to the selector 50 could be triggered in order to effect the new selection logic as shown in FIG. 6. Alternatively, the apparatus could merely be programmed to add one to the combination store and repeat the sequence.

Output device 60, which may be any type computer readout visually indicates the coordinates of the identification spots. Reading devices may now be manufactured specifically (as shown in FIG. 2a) for the six Hebrew letters. The reading machines will be described hereinafter.

FIGS. 9 and I illustrate the result of the application of the process to a specific type font for numerals and English letters, respectively.

FIGS. 2a through 2d are schematic illustrations of reading machines and components which may be employed in conjunction with the above apparatus or independently to pick up the signals of the spots.

In FIG. 2a, a light source 53 illuminates a mat via a lens 54. The letter to be read 52 is disposed on the document and the document moved by conventional means (not shown) to cause either a scanning of the letter or to effect the positioning of the letter within the field of view of the read-head 56. Lens 55 focuses the image of the particular letter under consideration upon the read-head which comprises a group of photosensitive cells 57 which are led by wires 58 to box 59 for further processing. In the example shown, five photocells are arranged according to the identification points (assuming a fivespot combination chart is employed). The photocells are normally conductive and the projective image of the pattern being read will render the cells nonconductive or lower the voltage in a conventional manner below some predetermined threshold. The output of the cells will thus be binary signals which may then be lead through conventional logic circuitry which has been greatly simplified by the reduced number of photocells (by virtue of the invention).

It is understood that more cells may be employed to read the configurations by line or by page.

FIG. 2b shows a multipurpose read-head which includes a mosaic of ll l5 photocells. Each photocell is isolated and connected with an independent output lead. When the identification points have been derived, those cells are rendered operative which correspond with the identification points derived by the inventive method. Alternatively, a mask or other method may be employed to inactivate other photocells such that only those cells which correspond to the identification spots are rendered effective.

FIG. 2c shows another conventional arrangement. In this figure, the identification spots are picked out in a successive manner by apparatus such as a flying spot scanner, cathode ray tube cameras with photoemissive mosaics, fiber optics, or tiny diodes. The pattern to be recognized 62 is printed upon the mat 61 which is transmitted to the signal pickup camera 63 via lens 64. FIG. 20 is a detail of the spot scan. The signal output is available over line 65 and transmitted to stage 66 which is a selection stage wherein all the unnecessary currents are excluded and only those carrying information of the identification spots are selected. As will be appreciated by those skilled in the art, this greatly reduces the necessary bandwidth. Further processing takes place in a conventional manner via stage 67.

With a flying spot scanner, it is necessary to pick up a great deal of unnecessary signals. As an alternative, it is possible to use optical fibers to transmit the identification signals into a linear array for scanning. FlG. 2d illustrates the method wherein the fibers 73 conduct the light signals between jig 71 and line 74.

While the principles of the invention have been described in connection with specific apparatus, it is to be clearly understood that this description is made only by way of example and not as a limitation to the scope of the invention as set forth in the objects thereof and in the accompanying claims.

Thus, for example, were the configurations magnetically written, then the matrix would consist of magnetic rather than light transducers.

What is claimed is:

1. A process for obtaining group identification positions for a set of character configurations comprising the steps of:

scanning each of the configurations by a coordinate matrix of transducers;

recording the outputs of each transducer for each of said character configurations corresponding to the presence or absence of a written area in each character at each coordinate sampling point;

numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results;

deriving a combinatorial constant n comprising the least value of n such that 2" z the number of character configurations;

deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configurations; and

selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting step including examining said combinatorial arrays for duplicative entries therein.

2. Apparatus for obtaining group identification positions for a set of character configurations comprising:

a coordinate matrix of transducers for scanning each of the configurations;

means for recording the outputs of each transducer for each of said character configurations corresponding to the presence or absence of a written area in each character at each coordinate sampling point;

means for numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results;

means for deriving a combinatorial constant n comprising the least value of n such that 2" e the number of character configurations;

means for deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configuration; and

means for selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting means including means for examining said combinatorial arrays for duplicative entries therein.

3. In combination in a reading machine for reading characters of a predetermined character set comprising an array of character configurations, said machine comprising a plurality of operative transducers disposed at selected ones of a coordinate array of character sampling stations for unambiguously identifying each of said character configurations, and means coupled to said transducers for unambiguously identifying each character depending upon the signal pattern provided by said transducers, said transducer stations being selected by scanning each of the configurations by a coordinate matrix of transducers; recording the outputs of each transducer for each of said character configuration corresponding to the presence or absence of a written area in each character at each coordinate sampling point; numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results; deriving a combinatorial constant It comprising the least value of n such that 2" 2 the number of character configurations; deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configurations; and selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting step including examining said combinatorial arrays for duplicative entries therein. 

1. A process for obtaining group identification positions for a set of character configurations comprising the steps of: scanning each of the configurations by a coordinate matrix of transducers; recording the outputs of each transducer for each of said character configurations corresponding to the presence or absence of a written area in each character at each coordinate sampling point; numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results; deriving a combinatorial constant n comprising the least value of n such that 2n the number of character configurations; deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configurations; and selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting step including examining said combinatorial arrays for duplicative entries therein.
 2. Apparatus for obtaining group identification positions for a set of character configurations comprising: a coordinate matrix of transducers for scanning each of the configurations; means for recording the outputs of each transducer for each of said character configurations corresponding to the presence or absence of a written area in each character at each coordinate sampling points; means for numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results; means for deriving a combinatorial constant n comprising the least value of n such that 2n the number of character configurations; means for deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configuration; and means for selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting means including means for examining said combinatorial arrays for duplicative entries therein.
 3. In combination in a reading machine for reading characters of a predetermined character set comprising an array of character configurations, said machine comprising a plurality of operative transducers disposed at selected ones of a coordinate array of character sampLing stations for unambiguously identifying each of said character configurations, and means coupled to said transducers for unambiguously identifying each character depending upon the signal pattern provided by said transducers, said transducer stations being selected by scanning each of the configurations by a coordinate matrix of transducers; recording the outputs of each transducer for each of said character configuration corresponding to the presence or absence of a written area in each character at each coordinate sampling point; numerically adding the recorded instances of signals inflicting written area at each corresponding transducer position over said character set and preserving the additive results; deriving a combinatorial constant n comprising the least value of n such that 2n the number of character configurations; deriving combinatorial arrays formed by the character configuration signal pattern of at least n transducers, each of said transducers detecting written area in at least n configurations; and selecting one of said combinatorial arrays of said transducer positions sufficient for uniquely recognizing each of said characters, said selecting step including examining said combinatorial arrays for duplicative entries therein. 